RESULT 1 
AAB03792 

ID AAB03792 Standard; protein; 424 AA. 
XX 

AC AAB037 92; 
XX 

DT 13-OCT-2000 (first entry) 
XX 

DE Murine AMSH protein sequence. 
XX 

KW AMSH; signal transduction; SH3 domain; cytokine; STAM; severe infection; 

KW cancer; autoimmune disorder; mouse. 

XX 

OS Mus sp. 
XX 

PN WO200029436-A1. 

XX 

PD 25-MAY-2000. 
XX 

PF 12-NOV-1999; 99WO- JP006309 . 
XX 

PR 12-NOV-1998; 98 JP-00322674 . 
XX 

PA (MISC-) JAPAN SCI & TECHNOLOGY CORP. 
XX 

PI Sugamura K, Tanaka N; 
XX 

DR WPI; 2000-387743/33. 
DR N-PSDB; AAA59853 . 
XX 

PT Protein AMSH. being signal transducer interacting with SH3 domain of 

PT cytokine signal transducer STAM, for diagnosis and development of drugs 

PT of treating diseases e.g. severe infection, cancers and autoimmune 

PT disorders. 

XX 

PS Claim 7; Page 16-17; 22pp; Japanese. 
XX 

CC This sequence represents the murine AMSH protein. The AMSH protein is 
CC involved in signal transduction, and interacts with the SH3 domain of 
CC cytokine signal transducer STAM. The invention includes human and murine 
CC AMSH protein and gene sequences (hAMSH and mAMSH) , and also includes 
CC antibodies directed against hAMSH and mAMSH. The AMSH protein can be used 
CC in the diagnosis and development of drugs for treating diseases due to 
CC the dysfunction of cytokine signal transduction e.g. severe infection, 
CC cancers and autoimmune disorders 
XX 

SQ Sequence 424 AA; 

Query Match 100.0%; Score 2236; DB 3; Length 424; 

Best Local Similarity 100.0%; Pred. No . . 6 . 4e-178 ; 

Matches 424; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 MSDHGDVSLPPQDRVRILSQLGSAVELNEDI PPRRYYRSGVEI IRMASVYSEEGNIEHAF 60 

MMIIIIIIIIMIIIIIIIIIIIIMIIMMMIIIIMIIIIIIIIIIMIIIIII 

Db 1 MSDHGDVSLPPQDRVRILSQLGSAVELNEDI PPRRYYRSGVEI IRMASVYSEEGNIEHAF 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



ILYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

ILYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 

YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFPIAFEEMIQRQELEKERLK 

lllllllllllllllllllllllillllllllllllllllllllllllllllMllilll 

YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 
IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 

illllllMlllllllllllllllllllllllllllllllllllllilllllllilllll 

IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 



120 



120 



180 



180 



240 



240 



ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTITHVL 3 00 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIilllllMIIIIIIIIIIIIIIIIIIIIIII 

ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTI THVL 3 00 
I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCSYQMMLP 360 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiriiiiiiiiiiiiii-iiiiii 

I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCS YQMMLP 360 

ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 

llllllllllllllllllllllllllllllllllllilllllllllllllllllllllll 
ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 



III 



AAB03791; 

13-OCT-2000 (first entry) 
Human AMSH protein sequence. 

AMSH; signal transduction; SH3 domain; cytokine; STAM; severe infection; 
cancer; autoimmune disorder; human. 

Homo sapiens . 

WO200029436-A1. 

25-MAY-2000. 

12-NOV-1999; 99WO- JP0063 09 . 
12-NOV-1998; 98 JP- 00322674 . 
(NISC-) JAPAN SCI & TECHNOLOGY CORP. 
Sugamura K, Tanaka N; 



DR WPI; 2000-387743/33. 

DR N-PSDB; AAA5 9852 . 
XX 

PT Protein AMSH being signal transducer interacting with SH3 domain of 

PT cytokine signal transducer STAM, for diagnosis and development of drugs 

PT of treating diseases e.g. severe infection, cancers and autoimmune 

PT disorders. 

XX 

PS Claim 1; Page 12-14; 22pp; Japanese. 
XX 

CC This sequence represents the human AMSH protein. The AMSH protein is 

GC involved in signal transduction, and interacts with the SH3 domain of 



CC cytokine signal transducer STAM. The invention includes human and murine 

CC AMSH protein and gene sequences {hAMSH and mAMSH) , and also includes 

CC antibodies directed against hAMSH and mAMSH. The AMSH protein can be used 

CC in the diagnosis and development of drugs for treating diseases due to 

CC the dysfunction of cytokine signal transduction e.g. severe infection, 

CC cancers and autoimmune disorders 

XX 

SQ Sequence 424 AA; 



Query Match 85.2%; Score 1904; DB 3; Length 424; 

Best Local Similarity ^83.5%; Pred. No. 3.4e-150; 

Matches 3 54; Conservative 35; Mismatches 35; Indels 0; Gaps 0 



Qy 


1 


MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 60 

llllllllllhllll IIIMMIhllMllllhlllllllMlhlllllllllll 

MSDHGDVSLPPEDRVRALSQLGSAVEVNEDI PPRRYFRSGVE 1 1 RMAS I YSEEGNI EHAF 6 0 


Db 


1 


Qy 


61 


ILYNKYITLFIEKLPKHRDYKSAIIPEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 

III II Ml III III MM Mill: INI II Mill Ml MM Ml IIIMIIIII : 

ILYNKYITLFI EKLPKHRDYKSAVI PEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTE 12 0 


Db 


61 


Qy 


121 


YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 

1 1 MM MMMMMMMMMiMMMMMMMMMMh MIIIMII 

YNEEKKKEAEELARNMAI QQELEKEKQRVAQQKQQQLEQEQFHAFEEMI RNQELEKERLK 180 


Db 


121 


Qy 


181 


IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 24 0 

Mlllllilll llhllllll :|l h 1 1 ihl :|IMIII|||||||| 
IVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPWDRSLKPG 240 


Db 


181 


Qy 


241 


ALS VI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKG I ETCGVLCGKLMRNEFTI THVL 3 00 

1 II h M II M II 1 M II M M II II II II 1 M M II 1 M 1 II 1 II II M 1 M 1 1 

ALSNSESI PTIDGLRHVWPGRLCPQFLQLASANTARGVETCGILCGKLMRNEFTITHVL 3 00 


Db 


241 


Qy 


301 


IPRQNGGPDYCHTENEEEIFFMQDDLGLLTLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 

MM: 1 llhllMlhl :|| IhllllllllllllMIIIIIIIIMIIIIIII 
IPKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 


Db 


301 


Qy 


361 


ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 42 0 

MMMMIMMMMMIhlhllhlMMMM :||llll llllll II III 

ESVAIVCSPKFQETGFFKLTDHGLEEISSCRQKGFHPHSKDPPLFCSCSHVTWDRAVTI 420 


Db 


361 


Qy 


421 


TDLR 424 


Db 


421 


MM 

TDLR 424 



RESULT 3 

AAB62193 
ID 
XX 



AAB62193 standard; protein; 424 AA. . Ifi'-V^ 



Smad associating polypeptide; SAP; cell cycle; TGF-beta; SAPl/AMSH; 
signal transduction; phosphorylation; gene therapy; human. 



>0i 

AC AAB62193; 
XX 

DT ll-JUN-2001 (first entry) 
XX 

DE Human SAPl/AMSH amino acid sequence. 
XX 
KW 
KW 
XX 

OS Homo sapiens . 
XX 

PN WO200121794-A2 . 
XX 

PD 2 9-MAR-2 001. 
XX 

PF 20-SEP-2000; 2000WO-US025790 . 
XX 

PR 20-SEP-1999; 99US-0154846P . 

XX 

PA (LUDW-) LUDWIG INST CANCER RES. 
XX 

PI Itoh F, Itoh S, Heldin C, Ten-Dijke P; 
XX 

DR WPI; 2001-257987/26. 

DR N-PSDB; AAF57478 . 
XX 

PT Novel Smad associating proteins useful for regulating cell cycle in a 

PT mammalian cell, and for identifying lead compounds for pharmacological 

PT agents. 
XX 

PS Claim 14; Page 53-55; 90pp; English. 
XX 

CC The invention relates to Smad associating polypeptides (SAP) ^ and nucleic 

CC acids encoding the SAPs . The SAPs can be expressed by standard 

CC recombinant methodology. The SAP proteins and polynucleotides are useful 

CC for regulating the cell cycle in a mammalian cell, and for modulating TGF 

CC -beta, superfamily signal transduction in a mammalian cell. SAP 

CC polypeptide are useful for testing Smad binding, for testing 

CC phosphorylation, for generating antibodies, and as components of an 

CC immunoassay. The SAP polynucleotides are useful in gene therapy. The 

CC present sequence represents the amino acid sequence of human SAPl/AMSH 

CC that can be used in modulating TGF-beta superfamily signal transduction 

CC in a mammalian cell 

XX 

SQ Sequence 424 AA; 

Query Match 85.2%; Score 1904; DB 4; Length 424; 

Best Local Similarity 83.5%; Pred. No. 3.4e-150; 

Matches 354; Conservative 35; Mismatches 35; Indels 0; Gaps 0 

Qy 1 MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 60 

I i I I I I I I M i : I I I I 1 I I I I I I I I : I I I I I I I I I : I I I I I I I I I I I : I I I I I I I I I I I 



1 MSDHGDVSLPPEDRVRALSQLGSAVEVNEDIPPRRYFRSGVEIIRMASIYSEEGNIEHAF 60 

Qy 61 ILYNKYITLFIEKLPKHRDYKSAIIPEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 

III! I IN I I Ml II II I II I I hi II I II Mill :|||llllll llhllllll : 
DD 61 ILYNKYITLFIEKLPKHRDYKSAVIPEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTE 120 

Qy 121 YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 

I I INIIIIIIhlllllllllllllllllhll Mill llllllh lllllllll 

Db 121 YNEEKKKEAEELARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMI RNQELEKERLK 180 

Qy 181 IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 240 

IIMIIIIIII llhllllll Ml h I I MM MUM II III Mill 

181 IVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPWDRSLKPG 24 0 
Qy 241 ALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTAKGIETCGVLCGKLMRNEFTITHVL 300 

III hMlhlllhlll II MllllllllhlMIIIMIIMIIIIIIIIIM 

241 ALSNSESIPTIDGLRHVWPGRLCPQFLQLASANTARGVETCGILCGKLMRNEFTITHVL 300 
Qy 301 IPRQNGGPDYCHTENEEEIFFMQDDLGLLTLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 

MHM Mhlllllhl Ml Ihlll III lllllllllllllll lllllllll I 

301 IPKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 
Qy 361 ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 

IhllllllllltllllllllMIMIIMIIIIIIII MIIIM llllll M III 

361 ESVAIVCSPKFQETGFFKLTDHGLEEISSCRQKGFHPHSKDPPLFCSCSHVTWDRAVTI 420 
Qy 421 TDLR 424 

MM 

Db 421 TDLR 424 



RESULT 4 
ABG30998 



ID ABG30998 standard; peptide; 424 AA. 



AC ABG30998; 
XX 

DT 21-OCT-2002 {first entry) 

XX 

DE Human JAM domain containing protein #5 



XX 
KW 
KW 
KW 
KW 
KW 



JAM domian; peptidase activity; cytostatic; antiinf ective; 
antiinflammatory; antiasthmatic; vasotropic; neuroprotective; 
antirheumatic; ant ipsoriatic; deconjugation; JAB siobunit; infection; 
neoplastic growth; angiogenesis; chronic inflammation; asthma; ischaemia; 
reperfusion; multiple sclerosis; rheumatoid arthritis; psoriasis; 
KW JAM domain; human. 
XX 

OS Homo sapiens . 
XX 

PN WO200255669-A2 . 
XX 

PD 18-JUL-2002. 
XX 

PF 14-JAN-2002; 2002WO-US000975 . 
XX 

PR 12-JAN-2001; 2001US- 0261314P . 



PR 14-SEP-2001; 2001US~032203 OP . 
PR 14-SEP-2001; 2001US-0322322P . 
XX 

PA (CALY ) CALIFORNIA INST TECHNOLOGY, 
XX 

PI Lyapina SA, Verma R, Deshaies R; 
XX 



DR WPI; 2002-575431/61, 
XX 

PT Deconjugating a modifier protein from a target protein for treating e.g. 

FT asthma, ischemia, psoriasis, and rheumatoid arthritis, comprises 

PT contacting the target protein to a polypeptide comprising a JAB subunit. 

XX 

PS Disclosure; Fig 2; 50pp; English. 
XX 

CC The invention describes a methof of deconjugating a modifier protein from 

CC a target protein, where the modifier protein is conjugated to the target 

CC protein via a peptide bond between the carboxy terminus of the modifier 

CC protenin and a free amino group of the target protein, comprises 

CC contacting the target protein to a polypeptide comprising a subunit 

CC characterised as JAB subunit. Also disclosed are method for screening for 

CC an agent that affects the deconjugation, and to increase conjugation of 

CC the modifier protein to the target protein. The agent that affects 

CC deconjugation can be used to treat neoplastic growth, angiogenesis, 

CC infection, chronic inflammation, asthma, ischaemia and reperfusion, 

CC multiple sclerosis, rheumatoid arthritis, or psoriasis. This is the amino 

CC acid sequence of a protein containing the JAM domain responsible for 

CC peptidase activity 

XX 

SQ Sequence 424 AA; 



Query Match 85.2%; Score 1904; DB 5; Length 424; 

Best Local Similarity 83.5%; Pred. No. 3.4e-150; 

Matches 354; Conservative 35; Mismatches 35; Indels 0; Gaps 0; 



Qy 


1 


MSDHGDVSLPPQDRVRILSQLGSAVELNEDI PPRRYYRSGVEI IRMASVYSEEGNIEHAF 60 

llllllllllhllll llllllllhlMIIMIhlllllilMlhllllllllMI 

MSDHGDVSLPPEDRVRALSQLGSAVEVNEDI PPRRYFRSGVEI I RMAS I YSEEGNI EHAF 6 0 


Db . 


1 


Qy 


61 


ILYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 

II MM II 1 II II II II II IMhl Mill Mill Mllllllll MM MM II : 

ILYNKYITLFI EKLPKHRDYKSAVIPEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTE 12 0 


Db 


'61 


Qy 


121 


YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 

1 1 III! MlllhMIIIIIMIIIIIIIIhlllllllllirilh MUM Ml 

YNEEKKKEAEELARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMIRNQELEKERLK 180 


Db 


121 


Qy 


181 


IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 24 0 

IIMIIIIMI llhllllll --W h 1 1 Ihl :||||||||||||||| 

IVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPWDRSLKPG 240 


Db 


181 


Qy 


241 


ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKG I ETCGVLCGKLMRNEFTI THVL 3 00 

Ml hMlhlllhlll II MllllllllhhIllhlllllllMIIIIMI 

ALSNSES I PTI DGLRHVWPGRLCPQFLQLASANTARGVETCGI LCGKLMRNEFTI THVL 300 


Db 


241 


Qy 


301 


I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 

IMh 1 llhlllMhl Ml IhllllllllMIMIIIIIIIMMMIMII 



Db 


301 


Dv 




Db 


361 


Qy 


421 


Db 


421 



301 I PKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 

ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 

.M:||||||||||||||||||:||:|||:||||lllll HIIMI |||||| I I ||| 
ESVAIVCSPKFQETGFFKLTDHGLEEISSCRQKGFHPHSKDPPLFCSCSHVTWDRAVTI 420 

TDLR 424 

MM 



RESULT 5 
ABBS 10 14 

ID ABB81014 standard; peptide; 424 AA. 
XX 

S ™B81014; 



DT 21-OCT-2002 (first entry) 
XX 

DE Human AMSH protein sequence. 
XX 

?CW JAM; Jabl-associated metalloenzyme motif; peptidase; cytostatic; 

KW antimicrobial; antiinflammatory; antiasthmatic; vasotropic; AMSH; 

KW neuroprotective; antirheumatic; antiarthritic; antipsoriatic; human, 
XX 

OS Homo sapiens. 
XX 

PN WO200255536-A2. 
XX 

PD 18-JUL-2002. 
XX 

PF 14-JAN-2002; 2002WO-US000976 . 
XX 

PR 12-JAN-2001; 2001US-0261314P. 

PR 14-SEP-2001; 2001US-0322030P. 

PR 14-SEP-2001; 2001US- 0322322P . 
XX 

PA (CALY ) CALIFORNIA INST OF TECHNOLOGY. 

XX 

PI Cope G, Verma R, Aravind L, Koonin EV, Deshaies R; 
XX 

DR WPI; 2002-590657/63. 
XX 

PT New polypeptide, useful for screening for compounds for treating 

PT neoplastic growth, angiogenesis , chronic inflammation, asthma, ischemia, 

PT . multiple sclerosis, rheumatoid arthritis or psoriasis, comprises Jabl- 

PT associated metalloenzyme motif. 

XX 

PS Disclosure; Fig 2; 35pp; English. 
XX 

CC The invention relates to an isolated (crystalline) polypeptide (I) 

CC comprising a Jabl-associated metalloenzyme motif (JAM) domain, where the 

CC JAM domain is not adjacent to an amino acid sequence that is naturally 

CC adjacent to the domain. (I) has peptidase activity and can be used for 

CC modulation of neddylation/de-neddylation; ubiquit ination/de- 

CC ubiquitination and as C0P9/ signal some, 26S proteasome, AMSH, AMSHl or 

CC AMSH2 inhibitors. Inhibitor for (I) that will form a bond with the JAM 



\ 



CC domain and modulators of (I) are useful for treating neoplastic growth, 

CC angiogenesis, infection, chronic inflammation, asthma, ischaemia and 

CC reperfusion injury, multiple sclerosis, rheumatoid arthritis or 

CC psoriasis. The present sequence represents a human AMSH protein 
XX 

SQ Sequence 424 AA; 



Query Match 85.2%; Score 1904; DB 5; Length 424; 

Best Local Similarity 83.5%; Pred. No. 3.4e-150; 

Matches 354; Conservative 35; Mismatches 35; Indels 0; Gaps 0; 



Qy 


1 


MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 

MIIIIIMIhllll llllllllhllllllllhlllillllllhlllllllllll 

MSDHGDVSLPPEDRVRALSQLGSAVEVNEDIPPRRYFRSGVEIIRMASIYSEEGNIEHAF 


60 


Db 


1 


60 


Qy 


61 


I LYNKYI TLFI EKLPKHRDYKSAII PEKKDAVKKLKSVAFPKAEELKTELLRRYTKE YEQ 

IIIIIIMIIIMIIIIIMIIhllMII Mill :|||||||ll llhllllll : 

ILYNKYITLFI EKLPKHRDYKSAVI PEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTE 


120 


Db 


61 


120 


Qy 


121 


YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 

1 1 MM lllllhlllMIIMIIIIIIMhIIIIMIIIIIIII: III II MM 

YNEEKKKEAEELARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMIRNQELEKERLK 


180 


Db 


121 


180 


Qy 


181 


IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 


240 


Db 


181 


MMIMIMI ll{:|||||| :|| h 1 1 Ihl :|||llllllllllll 
IVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPWDRSLKPG 


240 


Qy 


241 


ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTITHVL 

Ml h M 1 h II 1 h M 1 1 1 M 1 II 1 II II 1 : h 1 11 h M II II II II II II II 

ALSNSESIPTIDGLRHVWPGRLCPQFLQLASANTARGVETCGILCGKLMRNEFTITHVL 


300 


Db 


241 


300 


Qy 


301 


I PRQNGGPDYCHTENEEE I FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCS YQMMLP 360 

MM: 1 IMMMMIM Ml II M 1 II II II II 1 II 1 II 1 II II 1 1 II II II 1 1 

IPKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 


Db 


301 


Qy 


361 


ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 

MMIMIMI II MIIIIIIMIMIIMIIIIIIII Mill II llllll II III 

ESVAIVCSPKFQETGFFKLTDHGLEEISSCRQKGFHPHSKDPPLFCSCSHVTWDRAVTI 


420 


Db 


361 


420 


Qy 


421 


TDLR 424 




Db 


421 


TDLR 424 





RESULT 6 

ABW01032 



ID ABW01032 standard; protein; 424 AA. 
XX 

AC ABW01032; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE Human AMSH protein. 

XX 

KW Jabl-associated metalloenzyme motif; JAMM domain; isopeptidase; asthma; 

KW drug design; proliferative disorder; inflammation; autoimmune disorder; 

KW rheumatoid arthritis; neuroprotective; C0P9 signalsome complex; CSN; 



KW E2 ubiquitin conjugating enzyme; degradation pathway; multiple sclerosis; 

KW angiogenesis; psoriasis; cytostatic; ischaemia; vasotropic; human. 

XX 

OS Homo sapiens. 
XX 

PN US2003153097-A1. 
XX 

PD 14-AUG-2003. 
XX 

PF 09-JAN-2003; 2003US -00340578 . 
XX 

PR 12-JAN-2001; 2001US-0261314P . 

PR 14-SEP-2001; 2001US-0322030P . 

PR 14-SEP-2001; 2001US-0322322P . 

PR 14-JAN-2002; 2002US-00046961 . 

PR 14-JAN-2002; 2002US-00047253 . 

PR 06-FEB-2002; 2002US- 0355334P . 
XX 

PA (DESH/) DESHAIES R J. 

PA (COPE/) COPE'G. 

PA (VERM/) VERMA R. . , 

PA (AMBR/) AMBROGGIO XI. 

XX 

PI Deshaies RJ, Cope G, Verma R, Ambroggio XI; 
XX 

DR WPI; 2003-811195/76, 
XX 

PT Isolated C0P9 signalsome isopeptidase comprising a mutant JAMM domain, 

PT useful for identifying the agents for treating cancer, autoimmune disease 

PT by modulating the peptidase activity. 
XX 

PS Example 3; Fig 1; Opp; English. 
XX 

CC The invention relates to an isolated polypeptide comprising a mutant JAMM 

CC (Jabl -associated metalloenzyme motif) domain. JAMM domain is useful for 

CC identifying an inhibitor of an isopeptidase activity of a polypeptide by 

CC rational drug design. JAMM is useful for treating proliferative disorder, 

CC inflammatory disorder or an autoimmune disorder. The cell proliferative 

CC disorder is a neoplastic growth or psoriasis. The inflammatory disorder 

CC is an acute infection or a chronic inflammatory disorder. The autoimmune 

CC disorder is multiple sclerosis or rheumatoid arthritis. The conditions is 

CC angiogenesis, asthma or ischaemia and reperfusion injury. The polypeptide 

CC comprising the JAMM domain is JABI . The polypeptide comprising the JAMM 

CC domain comprises a C0P9 signalsome (CSN) complex. The method further 

CC involves administering to the subject at least a second agent that 

CC modulates a component of a protein degradation pathway comprising the 

CC polypeptide comprising the JAMM domain. The second agent reduces or 

CC inhibits the activity of the component of the protein degradation 

CC pathway. The component of the protein degradation pathway comprises an E2 

CC ubiquitin conjugating enzyme or an E2 ubiquitin ligase. The E2 ubiquitin 

CC conjugating enzyme is Cdc34 or a Ubc4/5 family member. The E3 ubiquitin 

CC ligase comprises Skpl, Cull/Cdc53, F-box protein, or their combination, 

CC The E3 ubiquitin ligase comprises SCF ubiquitin ligase. The present 

CC sequence is human AMSH protein used in the exemplification of the 

CC invention 
XX 

SQ Sequence 424 AA; 



Query Match 85.2%; Score 1904; DB 7; Length 424; 

Best Local Similarity 83.5%; Pred. No. 3.4e-150; 

Matches 354; Conservative 35; Mismatches 35; Indels 0; Gaps 0; 

MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 60 

Mllllllllhllll lllllllihilllllllhllllllllllhlllilllllll 

MSDHGDVSLPPEDRVRALSQLGSAVEVNEDIPPRRYFRSGVEIIRMASIYSEEGNIEHAF 60 
I LYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 

I II III II I Mill mill 1 1 hllll II Mill MM I MM I IIIMIIIII : 

ILYNKYITLFI EKLPKHRDYKSAVI PEKKDTVKKLKEIAFPKAEELKAELLKRYTKEYTE 120 

YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 

I I MM II III hllll II MUM II I llh MUM 11 Mill h IIIMIMI 

YNEEKKKEAEELARNMAIQQELEKEKQRVAQQKQQQLEQEQFHAFEEMIRNQELEKERLK 180 
IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 240 

Ml Mill I II llhlMIII Ml h I I MM MIIIIIIIIIIIIM 

IVQEFGKVDPGLGGPLVPDLEKPSLDVFPTLTVSSIQPSDCHTTVRPAKPPWDRSLKPG 240 

ALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTAKGIETCGVLCGKLMRNEFTITHVL 300 

Ml hMlhlllhlll II MllllllllhhIllhllllllllllllllll 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy • 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 



300 



MM: I llhlllllhl :|| IhlllllMMIIIIIIIIIIIIIIIIMIIII 

IPKQSAGSDYCNTENEEELFLIQDQQGLITLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 

ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 

:|||||| i||i|| || ||| 



RESULT 1 
Q9CQ26 

ID Q9CQ26 PRELIMINARY; PRT; 424 AA, 

AC Q9CQ26; 

DT Ol-JUN-2001 (TrEMBLrel. 17, Created) 

DT Ol-JUN-2001 (TrEMBLrel. 11, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE 5330424L14Rik protein (5730422LllRik protein) (RIKEN cDNA 5730422L11 
DE gene) (AMSH) (Associated molecule With the SH3 domain of STAM) . 
GN STAMBP OR AMSH OR 533 0424L14RIK OR 5730422L11RIK OR MAMSH . 
OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
DC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 

OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TI S SUE = Embryo , and Pituitary; 

RX MEDLINE-21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M., Itoh M. , Ishii Y., 

RA Arakawa T. , Kara A,, Fukunishi Y., Konno H. , Adachi J., Fukuda S.! 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H. , Kondo S., Yamanaka I.] 

RA Saito T., Okazaki Y. , Gojobori T., Bono H., Kasukawa T. , Saito R. 

RA Kadota K. , Matsuda H.A., Ashburner M., Batalov S., Casavant T. , 

RA Fleischmann W., Gaasterland T. , Gissi C. , King B., Kochiwa H./ 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G. , Quackenbush J. 

RA Schriml L.M. , Staubli F., Suzuki R. , Tomita M. , Wagner L. , Washio T. 

RA Sakai K. , Okido T. , Furuno M, , Aono H. , Baldarelli R., Barsh G. , . 

RA Blake J., Boffelli D,, Bojunga N, , Carninci P., de Bonaldo M.F.' 

RA Brownstein M.J., Bult C. , Fletcher C, Fujita M., Gariboldi M. / 

RA Gustincich S., Hill D. , Hofmann M. , Hume D.A. , Kamiya M. , Lee N.H,, 

RA Lyons P., Marchionni L., Mashima J., Mazzarelll J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N. , 

RA Sasaki H. , Sato K. , Schoenbach C. , Seya T. , Shibata Y. , Storch K.-F, 

RA Suzuki^ H., Toyo-oka K. , Wang K.H., Weitz C, , Whittaker C. , Wilming L 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional, annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690 (2001) . 
RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Brain ; 

RX MEDLINE=21570416; PubMed=11713295 ; 

RA Ishii N., Owada Y. , Yamada M. , Miura S., Murata K. , Asao H. , Kondo H, 

RA Sugamura K. ; 

RT "Loss of Neurons in the Hippocampus and Cerebral Cortex of AMSH- 

RT Deficient Mice."; 

RL Mol. Cell. Biol. 21:8626-8637(2001). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Breast tumor, and Liver; 

RA Strausberg R, ; 

RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AK019907; BAB3 1909.1; -. 

DR EMBL; AK017600; BAB30832.1; 

DR EMBL; BC003497; AAH03497.1; -. 

DR EMBL; BC006939; AAH06939.1; 

DR EMBL; AB010123; BAB78604,1; -. 



DR EMBL; BC025111; AAH25111.1; -. 

DR MGD; MGI: 1917777; Stambp . 

DR GO; GO: 0006916; P : ant i -apoptosis ; IMP. 

DR InterPro; IPR000555; Peptidase_M67 . 

DR Pfam; PF01398; Mov34; 1. 

DR SMART; SM00232; JAB_MPN; 1. 

SQ SEQUENCE 424 AA; 48513 MW; D3A9C2B5F65B7C5E CRC64 ; 



Query Match'" 100.0%; Score 2236; DB 11; Length 424; 

Best Local Similarity 100.0%; Pred. No. 6.8e-149; 

Matches 424; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


1 


MSDHGDVSLPPQDRVRILSQLGSAVELNEDI PPRRYYRSGVEI I RMASVYSEEGNI EHAF 6 0 

MIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIMMIIMIIIMMIIMI 

MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 60 


Db 


1 


Qy 


61 


ILYNKYITLFIEKLPKHRDYKSAIIPEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 

MIIMMMIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIillllllllllll 

ILYNKYITLFIEKLPKHRDYKSAI IPEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 120 


Db 


61 


Qy 


121 


YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 

MMIIIIIIIIIIIMIIIMIIMIIIIIMIIIIIIIIIMIIIMIIIIIIIIMI 

YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 180 


Db 


121 


Ov 


J. O -L 


i. vyCir LjJ\ViJrLjrv^<^i-'ijljFUijlilvFL VUvArbbF PULN IGMRPAKPPWDRSLKPG 240 

IIIIIMIIIIIIIIilllllllllllillllllllllllllllllllllllllllliii 

IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 24 0 


Db 


181 


Qy 


241 


ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTI THVL 3 00 

MIIIIIIIIIMIMIIIMMIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMIII 

ALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTAKGIETCGVLCGKLMRNEFTITHVL 3 00 


Db 


241 


Qy 


301 


IPRQNGGPDYCHTEMEEEIFFMQDDLGLLTLGWIHTHPTQTAFLSSVDLHTHCSYQMMLP 360 

IIIMIMIIIIIIIIIIIIIMIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCSYQMMLP 360 


Db 


301 


Qy 


361 


ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 

IMMIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMMIllMIIIIIIM 

ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 420 


Db 


361 


Qy 


421 


TDLR 424 


Db 


421 


MM 

TDLR 424 



RESULT 2 
Q8R424 

ID Q8R424 PRELIMINARY; PRT; 424 AA. 

AC Q8R424; 

DT Ol-JUN-2002 (TrEMBLrel . 21, Created) 
DT Ol-JUN-2002 (TrEMBLrel. 21, Last sequence update) 
DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE AMSH. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 



RN [1] 

RP SEQUENCE FROM N,A. 

RA Pawlak A., Guellaen G. ; 

RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases, 

DR EMBL; AY083159; AAL92520.1; -. 

DR InterPro; IPR000555; Pept idase_M67 . 

DR Pfam; PF01398; Mov34; 1. 

DR SMART; SM00232; JAB_MPN; 1. 

SQ SEQUENCE 424 AA; 48511 MW; 90BE083ADACF4B44 CRC64; 



Query Match 98.1%; Score 2193; DB 11; Length 424; 

Best Local Similarity 97.6%; Pred. No. 7.1e-146; 

Matches 414; Conservative 7; Mismatches 3; Indels 0; Gaps 0; 



Qy 


1 


MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEIIRMASVYSEEGNIEHAF 

MM MMMMMMMMMMMMMMMhMMMMMhMMMMMI 

MSDHADVSLPPQDRVRILSQLGSAVELNEDI PPRRYFRSGVEI IRMASI YSEEGNIEHAF 


60 


Db 


1 


60 


Qy 


61 


I LYNKY I TLF I EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPKAEELKTELLRRYTKEYEQ 

MMMMMMMMMMMMMMMMMMMMMMMMMhMMMM 

ILYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKNVAFPKAEELKTELLKRYTKEYEQ 


120 


Db 


61 


120 


Qy 


121 


YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEEMIQRQELEKERLK 

MMMMMMMMMMMMMMMMMMMMMMMMMHMMMM 

YKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQEQFHAFEKMIQKQELEKERLK 


180 


Db 


121 


180 


Qy 


181 


IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTPDCNTGMRPAKPPWDRSLKPG 

MMMMMMMMMMMMMMMMMMM MM MMMMMMMM 

IVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQTSDCNTTLRPAKPPWDRSLKPG 


240 


Db 


* 181 


240 


Qy 


241 


ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTI THVL 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

ALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTAKGI ETCGVLCGKLMRNEFTI THVL 


300 


Db 


241 


300 


Qy 
Db 


301 
301 


I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCSYQMMLP 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

I PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI HTHPTQTAFLSSVDLHTHCSYQMMLP 


360 
360 


Qy 


361 


ES I AI VCS PKFQETGFFKLTDYGLQE I STCRQKGFHPHGRDPPLFCDCSHVTVKDR I VTI 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

ESIAIVCSPKFQETGFFKLTDYGLQEISTCRQKGFHPHGRDPPLFCDCSHVTVKDRIVTI 


420 


Db 


361 


420 


Qy 


421 


TDLR 424 

MM 

TDLR 424 




Db 


421 





RESULT 1 

E35542 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



linear PAT 31-JAN-2 002 



E35542 1384 bp DNA 

Protein AMSH and cDNA thereof. 
E35542 

E35542 .1 GI : 18624563 
JP 2000139469-A/2. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1384) 
Sugamura,K. and Tanaka,N. 
Protein AMSH and cDNA thereof 
Patent: JP 2000139469-A 2 23-MAY-2000; 
SCIENCE & TECH AGENCY 
OS Homo sapiens (human) 
JP 2000139469-A/2 
23-MAY-2000 

12-NOV-1998 JP 1998322674 



PN 
PD 
PF 
PR 
PI 
PC 
PC 
PC 
PC 

cc 

FH 
FT 



FEATURES 

source 



ORIGIN 



KAZUO SUGAMURA,NOBUYUKI TANAKA 

C12N15/09,C07K14/52,C07K16/24//C12Nl/21,C12P2l/02, {C12N15/09, 
C12R1:91) , 

(C12N1/21,C12R1:19) , {C12P21/02 , C12R1 : 19) ,C12N15/00, (C12N15/00 
C12R1:91) 

Key Location/Qualifiers 
CDS 56. .1327. 

Location/Qualifiers 

1. .1384 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db__xref="taxon: 9606" 



Query Match 100.0%; Score 1384; DB 6; Length 1384; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1384; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GTGACGTTTCCGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTC 60 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 GTGACGTTTCCGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTC 60 
Qy 61 TGACCATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGG 120 

, M M I M I M 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 

Db 61 TGACCATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGG 120 

Qy 121 GAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGA 180 

,,11111111111 IIIMIIIIMIMIIIIIIIIIIIIMIIIIIIIMIIIIIIIlllll 

Db 121 GAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGA 180 

Qy 181 GATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCT 24 0 

IIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIMIIIII 

Db 181 GATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCT 24 0 



Qy 



241 CTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATC 3 00 



Db 


241 


IMIIIMMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMlll 

CTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTAO^ 


300 


Qy 


301 


AGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ,,,, 

IMMMIMMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIM 

AGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAA 


360 


Db 


301 


360 


Qy 


361 


AGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAA 

llllllllltlElllllllllltiitiiiiiiiiiii t 

MMIMMIMIMIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIM 

AGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAA 


420 


Db 


361 


420 


Qy 


421 


AGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTT 

1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTT 


480 


Db 


421 


480 


Qy 


481 


GGAAAAAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATT 

IIIMMMIIIIMIIIMIMIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

GGAAAAAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATT 


540 


Db 


481 


540 


Qy 


541 


CCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGT 

MMMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMIIIIII 

CCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGT 


600 


Db 


541 


600 


Qy 


601 


TCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAA 

1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 { 1 1 

TCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAA 


660' 


Db 


601 


660 


Qy 


661 


GCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAA 

MIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIMINIIIIIIIILIMIIMIII 

GCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAA 


720 


Db 


661 


720 


Qy 


721 


CACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTT 

t 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 f 1 1 1 1 1 1 1 I 1 1 1 I 1 I ■ 1 I I 1 1 . 1 t t , 


780 


Db 


721 


M 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTT 


780 


Qy 


781 


AAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAA 

IMIIIIIMIIIIIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

AAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAA 


840 


Db 


781 


840 


Qy 


841 


TCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTG 

Mllllll III III III Mill MIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIII 

TCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTG 


900 


Db 


841 


900 


Qy 


901 


TGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCC 


960 


Db 


901 


IIMIIIIIII lllllllllll llllllllllllllllllllllllllllll Mllllll 

TGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCC 


960 


Qy 


961 


CAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTAT 


1020 


Db 


961 


IIMMIIMI Mill Mill III MM II Mllllll IIMIII III Mill IIMMI 

CAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTAT 


1020 


Qy 


1021 


GCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGC 

MMMMMMMM Mllllll Mllllll MM II Mllllll MM! II IIMMI 

GCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCau^ 


1080 


Db 


1021 


1080 


Qy 


1081 


CTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTC 

MMMMMMMM IIIIIIMMIIIIIMIIIIIIIIIIIIMIIIIMMIIIII 


1140 



Db 



1081 CTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTC 1140 



Qy 


1141 


CATCGCAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTA 

IIMIIIIIIIIIIMIIIIIIIIIIMMIIIIIMIIIIIIMMIIIIMIIIIIII 

CATCGCAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTA 


1200 


Db 


1141 


1200 


Qy 


1201 


TGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCC 

MIIIIIIIIIIMIIIIIIIIIMIIIIIIIillllllllllMIIIIIIIIIIMIil 

TGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCC 


1260 


Db 


1201 


1260 


Qy 


1261 


ACCGCTGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGA 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIMIIIIIIIIIIII 

ALLbLTbTTLTGTGACTGCAGCCATGTCACTGTCAAGGACAG^ 


1320 


uD 


1 O C T 

IZo 1 


1320 


Qy 


1321 


CCTTCGATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTC 

IIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMM 

CCTTCGATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTC 


1380 


Db 


1321 


1380 


Qy 


1381 


ACCA 1384 

MM 

ACCA 1384 




Db 


1381 





RESULT 2 

AB010123 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AB010123 2104 bp mRNA linear ROD 06-DEC-2001 

Mus musculus mRNA for AMSH, complete cds . 

AB010123 

AB010123 .1 GI : 17385633 
AMSH. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Ishii,N., Owada,Y., Yamada,M. , Miura,S., Murata, K. , Asao, H. , 
Kondo , H . and Sugamura , K . 

Loss of neurons in the hippocampus and cerebral cortex of 
AMSH-def icient mice 

Mol. Cell. Biol. 21 (24)-, 8626-8637 (2001) 

21570416 
11713295 

2 (bases 1 to 2104) 

Ishii,N., Aizawa,K. and Sugamura, K. 

Direct Submission 

Submitted {06-JAN-1998) Kazuo Sugamura, Tohoku University School of 
Medicine, Department of Microbiology and Immunology; 2-1 
Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8575, Japan 
(E-mail : sugamura@mail . cc . tohoku. ac . jp, Tel : 81-22-717-8096, 
Fax: 81-22-717-8097) 

Location/Qualifiers 

1. .2104 

/organ ism= "Mus musculus" 
/ mol_type= "mRNA" 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/tissue_type= "brain" 



gene 1, .2104 

/gene=" mamsh." 
CDS 56. .1330 

/gene=" mamsh" 



/note="associated molecule with the SH3 domain of STAM" 

/codon_start=l 

/product ="AMSH" 

/protein_id="BAB78604 .1" 

/db_xref="GI : 17385634" 

/translation="MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEII 
RMASVYSEEGNI EHAFILYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPK 
AEELKTELLRRYTKEYEQYKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQE 
QFHAFEEMIQRQELEKERLKIVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQT 
PDCNTGMRPAKPPWDRSLKPGALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTA 
KGI ETCGVLCGKLMRNEFTI THVLI PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI 
HTHPTQTAFLSS VDLHTHCS YQMMLPES I AI VCS PKFQETGFFKLTDYGLQE I STCRQ 
KGFHPHGRDPPLFCDCSHVTVKDRI VTI TDLR " 



ORIGIN 








Query Match 100.0%; Score 1384; DB 10; Length 2104; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1384; Conservative 0; Mismatches 0; Indels 0; Gaps 


0; 




■t 
1 


GTGACGTTTCCGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTC 

1 1 1 11 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

u I LjALU i i i CLULiAAuL 1 L i LjAL 1 Lr i LA 1 LL i i vJALUAAAbAAL I i A i I i b i LLAATGTL 


60 


UD 


T 

X 


60 


Qy 


61 


TGACCATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGG 

lllhlllllllllllllilllllllllllllllllilllllllllllillllllllllll 

TGACCATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGG 


120 


Db 


61 


120 


Qy 


121 


GAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGA 

IIIIIIIIIIIMIIINIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIII 

GAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGA 


180 


Db 


121 


ion 


Qy 


181 


GATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCT 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 
GATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCT 


240 


Db 


181 


240 


Qy 


241 


CTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATC 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIM 

CTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATC 


300 


Db 


241 


300 


Qy 


301 


AGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAA 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIilllilllllllll 

AGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAA 


360 


Db 


301 


360 


Qy 


361 


AGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAA 

IIIIIMIMIIIIIIIIIIIIIMMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIII 

AGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAA 


420 


Db 


361 


420 


Qy 


421 


AGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 

AGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTT 


480 


Db 


421 


480 


Qy • 


481 


GGAAAAAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATT 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 


540 



Db 481 GGAAAAAGAAAAACAGAGGGTTGCTO^GCAGAAGCAGAAGCAGCTAGAGCAGGAGG^ 540 

Qy 541 CCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGT 600 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 541 CCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGT 600 

Qy 601 TCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAA 660" 

IIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 601 TCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAA 660 

Qy 661 GCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAA 720 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIMIMIIII 

Db 661 GCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAA 72 0 

Qy 721 CACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTT 78 0 

lllllllillllllllllllllllllllllllllllllllllllllllMIIIIIIIIII 

Db 721 CACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTT 780 

Qy 781 AAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAA 84 0 

llllliMlllllllillllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 781 AAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAA 84 0 

Qy 841 TCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTG 900 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 
Db 841 TCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTG 900 

Qy 901 TGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCC 960 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 901 TGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCC 960 

Qy 961 CAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTAT 1020 

IIIIIIMI Mill IIIIIIIIIIIM II MINI II Mill IIIIIIIIIIIIIIIMI 

Db 961 CAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTAT 1020 

Qy 1021 GCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGC 1080 

llllll II IIIIMIII III llllllllllllll IIIIIIMI MM III llllllllll 

Db 1021 GCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGC 108 0 

Qy 1081 CTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTC 114 0 

llllll II MM II III Ml IMIIIIIIMI II III llllllllll MM llllll I II 

Db 1081 CTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTC 114 0 

Qy 1141 CATCGCAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTA 1200 

I II II 1 1 1 II II 1 1 M II M I M II I II II 1 1 1 II II II II I II 1 1 II II I II I II 1 1 II 

Db 1141 CATCGCAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTA 1200 

Qy 1201 TGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCC 1260 

llllll II MM II llllll llllllllllll III II Mill Mill llllllllll I II 

Db 1201 TGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCC 1260 

Qy 1261 ACCGCTGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGA 1320 

I Mill MM MM llllll I Mill I II III III II Mill Mill llllllllll III 

Db 1261 ACCGCTGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGA 132 0 

Qy 1321 CCTTCGATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTC 1380 

llllllllll MM llllll lllllll II I IIIIIJMIIMIIIII III Mlllll III 

Db 1321 CCTTCGATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTC 1380 



Qy 1381 ACCA 1384 

Db 1381 ACCA 1384 



RESULT 3 
BC025111 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



Stapleton.M. , 
Scheetz,T.E. , 
Carninci, P. , 
Abramson , R . D . 
McKernan, K. J . 
Worley,K.C. , 



BC025111 2094 bp mRNA linear ROD 12-NOV-2003 

Mus musculus Stam binding protein, mRNA (cDNA clone MGC: 36212 
IMAGE:4194332) , complete cds . 
BC025111 

BC025111 .1 GI : 19263555 
MGC . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi / 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

1 (bases 1 to 2094) 

Strausberg,R.L, , Feingold, E . A. , Grouse, L.H. , Derge, J.G. , 
Klausner,R.D. , Collins,F.S . , Wagner, L., Shenmen , C . M , , Schuler , G . D. , 
Altschul,S.F. , Zeeberg,B., Buetow^K.H., Schaef er, C . F . , Bhat,N.K., 
Hopkins , R . F . , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko,L. , Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 
Scares , M . B . , Bonaldo , M . F . , Casavant , T . L . , 
Brownstein,M. J. , Usdin,T.B. , Toshiyuki,S. , 
Prange,C. , Raha,S.S. , Loquellano, N . A . , Peters, G.J. , 
, Mullahy,S. J. , Bosak,S.A,, McEwan,P.J., 
, Malek,J.A., Gunaratne, P.H. , Richards, S., 
Hale,S., Garcia, A.M., Gay,L.J., -Hulyk,S.W., 
Villalon,D.K. , Muzny,D.M., Sodergren , E . J . , Lu,X, , Gibbs,R.A., 
Fahey, J. , Helton, E. , Ketteman,M. , Madan,A. , Rodrigues,S. , 
Sanchez, A., Whiting, M. , Madan,A. , Young, A. C, Shevchenko , Y . , 
Bouf fard,G.G, , Blakesley, R.W. , Touchman , J . W . , Green, E.D, , 
Dickson,M.C. , Rodriguez , A . C , , Grimwood,J., Schmutz,J., Myers, R.M,> 
Butterf ield, Y.S. , Krzywinski ,M. I . , Skalska,U. , Smailus,D.E. , 
Schnerch,A. , Schein,J.E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length, 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 2094) 
Strausberg, R, 
Direct Submission 

Submitted ( 05-MAR-2 002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC), Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact : MGC help desk 

Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 

Sequencing Center 



Center code: BCM-HGSC 

Web site: http : //www . hgsc . bcm, tmc . edu/cdna/ 
Contact : amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., 
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, 
A.N., Gibbs, R.A. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 61 Row: j Column: 3 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 17941276. 
FEATURES Location/Qualifiers 
source l. .2 094 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="FVB/N" 
/db_xref-"taxon: 10090" 
/clone="MGC: 36212 IMAGE: 4194332" 

/tissue_type=" Liver, normal. 5 month old male mouse." 

/ c 1 one_l ib = " NCI _CGAP_L 19" 

/ 1 ab_hos t = " DH 1 0 B " 

/note=" Vector : pCMV-SP0RT6" 
gene 1. .2094 

/gene-"Stambp " 

/note=" synonym: Amsh" 

/db_xref="LocuslD: 70527" 

/db_xref="MGI : 1917777" 
CDS 51. .1325 

/codon_start=l 

/product ="Stam binding protein" 
/protein_id="AAH25111. 1" 

/db_xref="GI : 19263556" 
/db_xref="LocusID: 70527" 

/translation="MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEII 
RMASVYSEEGNIEHAFILYNKYITLFIEKLPKHRDYKSAI I PEKKDAVKKLKSVAFPK 
AEELKTELLRRYTKEYEQYKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQE 
QFHAFEEMIQRQELEKERLKIVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQT 
PDCNTGMRPAKPPWDRSLKPGALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTA 
KGI ETCGVLCGKLMRNEFTITHVLI PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI 
HTHPTQTAFLSSVDLHTHCSYQMMLPESIAIVCSPKFQETGFFKLTDYGLQEISTCRQ 
KGFHPHGRDPPLFCDCSHVTVKDRIVTITDLR" 
misc_feature 804. .1133 

/note="Mov34; Region: Mov34/MPN/PAD-l family. Members of 
this family are found in proteasome regulatory subunits, 
eukaryotic initiation factor 3 {eIF3) subunits and 
regulators of transcription factors. This family is also 
known as the MPN domain and PAD-l-like domain. It has been 
shown that this domain occurs in prokaryotes" 
/db_xref = " CDD : pf amO 13 9 8 " 

ORIGIN 

Query Match 99.6%; Score 1379; DB 10; Length 2094; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1379; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



6 GTTTCCGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTCTGACC 65 



Db 


1 


MlllllllllllllllllllllllllilllllllMMIIIIIIIIIIlllMIIIIII 

GTTTCCGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTCTGACC 


60 


Qy 


66 


ATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGGGAGTG 

lllllllllllllllllllllllllllllilllliMllllllllllllllljlllllll 

ATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGGGAGTG 


125 


Db 


61 


120 


Qy 


126 


CAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGAGATCA 

MIMIIIIIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIMIIMIIIIIMIM 

CAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGAGATCA 


185 


Db 


121 


180 


Qy 


186 


TCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCTCTACA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCTCTACA 


245 


Db 


181 


240 


Qy 


246 


ACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATCAGCTA 

IIIIMIIIIIIIIMIIIIIIIIIIIMIIIMIIIMIIIIIIIIIIIIIMIIIIM 

ACAAGTACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATCAGCTA 


305 


Db 


241 


300 


Qy 


306 


TCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAAAGCGG 

NIMIIIIIIIIIMIIIINIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

TCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAAAGCGG 


365 


Db 


301 


360 


Qy 


366 


AAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAAAGAGC 

MIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

AAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAAAGAGC 


425 


Db 


361 


420 


Qy 


426 


GAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTTGGAAA 

IIIIIIIMMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

GAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTTGGAAA 


485 


Db 


421 


480 


Qy 


486 


AAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATTCCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATTCCATG 


545 


Db 


481 


540 


Qy 


546 


CCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGTTCAAG 

IIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIilllllMIIIIIIIIIMIIIII 

CCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGTTCAAG 


605 


Db 


541 


600 


Qy 


606 


AGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCTT 

III II MINI Mill III MINI MINI II MM II Mill II II III III INI II 

AGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCTT 


665 


Db 


601 


660 


Qy 


666 


GTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAACACAG 

IIIMIMIIIMMIIIIIIIMIIIIMIMIMMIMMIIMIIIIIMIIMII 

GTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAACACAG 


725 


Db 


661 


720 


Qy 


726 


GCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTTAAGCG 


785 


Db 


721 


IIMIMIIIMMMIIMIIMIIIIIIIMIIMIIMMMIIIIIIIIIIIIIII 

GCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTTAAGCG 


780 


Qy 


786 


TCATAGAAAATGTTCCCACCATTGAAGGCCTGGGCCACATCGTGGTGCCCCGTAATCTGT 

IIIIMMMIIIMMMMIIIIIIIMMIIMIIIIIIIIIIMMIIMIIIIII 

TCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAATCTGT 


845 


Db 


781 


840 


Qy 


846 


GCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTGTGGAG 

1 II M M M M M M M M M M M M M M M M M M M M M M j M M M M M M 


905 



Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



841 GCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTGTGGAG 900 



906 



901 



966 



961 



1026 



TCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCCCAGAC 

IMIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIMIMMIIIIIII 

TCCTCTGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCCCAGAC 
AAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTATGCAGG 

IMMIIIMMIIIIIMIilllllMIIIIMMMMliMIIIIIMIIIIIIIII 

AAAATGGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTATGCAGG 



ATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACXIJGCCTTTC 

MIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIMMIIIIIIMIIIMIIIMM 

1021 ATGACCTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGCCTTTC 



965 



960 



1025 



1020 



1085 



1080 



1086 TGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTCCATCG 1145 

lllllilMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

1081 TGTCCAGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTCC^^ 1140 
1146 CAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTATGGTC 1205 

IMIIIIMIIMIIIIIIIIIMIIIIIMIIIIIMMIIIMIIIIIIIIIIIIIII 

1141 CAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTATGGTC 1200 

1206 TTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGC 1265 

I I M I I I I I I I I I I I i M I M I M I I I I I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 { 1 1 1 1 1 1 1 1 1 
1201 TTCMlGAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGC 1260 

1266 TGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGACCTTC 1325 

IIMIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIilll'llllllll 

1261 TGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGACCTTC 132 0 



1384 



132 6 GATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTCACCA 

1 1 M-l 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 1 1 { 1 1 1 1 

1321 GATAAATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTCACCA 1379 



RESULT 4 
BC003497 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



mRNA linear ROD 03-OCT-2003 
mRNA (cDNA clone MGC:6953 



BC003497 2122 bp 

Mus musculus Stam binding protein, 
IMAGE:3153829) , complete cds . 
BC003497 

BC0034 97 .1 GI : 13097542 
MGC. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 (bases 1 to 2122) 
Strausberg^R.L. , Feingold, E. A. , Grouse, L.H., Derge, J.G. , 
Klausner,R.D. , Coll ins , F . S . , Wagner, L., Shenmen , C . M . , Schuler , G . D, , 
Altschul,S.F. , 2eeberg,B., Buetow^K.H., Schaefer, C. F, , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J. , Hsieh,F., 
Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 
Scares, M.B., Bonaldo, M . F . , Casavant , T, L . , 
Brownstein,M. J, , Usdin,T.B. , Toshiyuki , S . , 
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A. , Peters, G. J., 
Abramson,R.D. , Mullahy, S - J . , Bosak,S.A., McEwan,P.J., 



Craniata ; Vertebrata ; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



Diatchenko, L. 

Stapleton, M. , 
Scheet2,T.E. , 



McKernan,K. J. , Malek,J.A., Gunaratne, P . H. , Richards, S., 
Worley,K.C., Hale,S., Garcia, A. M. , Gay,L,J., Hulyk,S.W., 
Villalon,D.K. , Muzny,D.M., Sodergren, E . J . , Lu,X., Gibbs,R.A., 
Fahey,J., Helton,E., Ketteman.M. , Ma dan, A. , Rodrigues , S . , 
Sanchez, A., Whiting, M., Madan,A. , Young, A. C, Shevchenko , Y . , 
Bouf fard,G.G. , Blakesley , R . W. , Touchman , J . w . , Green, E.D. , 
Dickson,M.C. , Rodriguez , A. C. , Grimwood,J., Schmutz,J,, Myers, R.M. 
Butterf ield, Y.S, , Krzywinski,M. I , , Skalska,U. , Smailus , D. E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 

TITLE Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

JOURNAL Proc. Natl. Acad.Sci. U.S.A. 99 (26), 16899-16903 (2002) 

MEDLINE 22388257 
PUBMED 12477932 
REFERENCE 2 (bases 1 to 2122) 

AUTHORS St rausberg , R , 

TITLE Direct Submission 

JOURNAL Submitted (20-FEB-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Lothar Hennighausen Ph.D., Chu-Xia Deng Ph.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consort-ium (LLNL) 

DNA Sequencing by: Baylor College of Medicine Human Genome 

Sequencing Center 

Center code: BCM-HGSC 

Web site: http://www.hgsc.bcm.tmc.edu/cdna/ 
Contact ; amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, ' A.M. , Lu, X., Hulyk, S.W. 
Kowis, C.R., Sneed, A.J,, Martin, R.G., Muzny, D.M 
A.N,, Gibbs, R.A. 



Louis eged, H. , 
Nanavati, 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 5 Row: p Column: 22 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi: 17941276. 
FEATURES Location/Qualifiers 
source . 1. .2122 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="129, C57BL/6J,FVB/N" 
/db_xref="taxon: 10090" 
/clone="MGC:6953 IMAGE: 3153829" 

/tissue_type="Mammary tumor. Brcal-/fl; MMTV-Cre model. 10 
months old, gross tissue." 
/clone_lib="NCI_CGAP_Mam3 " 
/lab_host="DH10B" 
/note-" Vector: pCMV-SP0RT6" 
gene 1. .2122 

/gene="Stambp" 
/note=" synonym: Amsh" 
/ db_xr e f = " Lo cus I D : 7 0 5 2 7 " 



/db_xref ="MGI : 1917777" 
CDS 52, .1326 

/codon_start=l 

/product="Stam binding protein" 
/protein_id="AAH034 97 . 1" 
/db_xref-"GI : 13097543" 
/db_xref="LocusID: 70527" 

/trans 1 at ion= " MSDHGDVSLPPQDRVRI LSQLGSAVELNEDI PPRRYYRSGVEI I 
RMASVYSEEGNI EHAFI LYNKYITLFI EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPK 
AEELKTELLRRYTKEYEQYKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQE 
QFHAFEEMIQRQELEKERLKIVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQT 
PDCNTGMRPAKPPWDRSLKPGALSVI ENVPTI EGLRHI WPRNLCSEFLQLASANTA 
KGIETCGVLCGKLMRNEFTITHVLIPRQNGGPDYCHTENEEEIFFMQDDLGLLTLGWI 
HTHPTQTAFLSSVDLHTHCSYQMMLPESIAIVCSPKFQETGFFKLTDYGLQEISTCRQ 
KGFHPHGRDPPLFCDCSHVTVKDRIVTITDLR" 
misc_feature 805. .1134 

/note="Mov34; Region: Mov34/iyiPN/PAD-l family. Members of 
this family are found in proteasome regulatory subunits, 
eukaryotic initiation factor 3 {eIF3) subunits and 
regulators of transcription factors. This family is also 
known as the MPN domain and PAD-l-like domain. It has been 
shown that this domain occurs in prokaryotes" 
/ db_xr e f = " ODD : p f amO 1398" 

ORIGIN 



Query Match 98.8%; Score 1367.6; DB 10; Length 2122; 

Best Local Similarity 99.7%; Pred. No. 0; 

Matches 1370; Conservative 0; Mismatches 4; Indels 0; Gaps 0'; 



Qy 


11 


CGGAAGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTCTGACCATGGG 


70 






II llllllllllllllllllllllllllllllllllllllllllllillllllllll 




Db 


7 


CGTCCGCTCTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTCTGACCATGGG 


66 


Qy 


71 


GATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGGGAGTGCAGTT 


130 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM'IIIIIIIIIIII.111111111111111 




Db 


67 


GATGTGAGCCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGGGAGTCCAGTT 


126 


Qy 


131 


GAGTTAAATGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGAGATCATCCGC 


190 






IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIilillll 




Db 


127 


GAGTTAAATGAAGACATTGCACCCCGTCGCTACTACCGCTCCGGTGTTGAGATCATCCGC 


186 


Qy 


191 


ATGGCGTCCGTTTACTCGGAAGAAGGGAACATTGAACATGCCTTTATCCTCTACAACAAG 


250 






IIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIi 




Db 


187 


ATGGCGTCCGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCTCTACAACAAG 


246 


Qy - 


251 


TACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATCAGCTATCATT 


310 






MllllillilllllllMMIMIIIIIillllllllllllMlllllllillllMM 




Db 


247 


TACATCACGCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATCAGCTATCATT 


306 


Qy 


311 


CCTGAGAAGAAAGATGCTGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAAAGCGGAAGAG 


370 






IIIMIIIIIIIIIIII IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIilllllMI 




Db 


307 


CCTGAGAAGAAAGATGCCGTCAAGAAATTAAAGAGCGTCGCTTTCCCTAAAGCGGAAGAG 


366 


Qy 


371 


CTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAAAGAGCGAAAG 


430 






IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


367 


CTGAAGACAGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAAAGAGCGAAAG 


426 



Qy 


431 


AAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTTGGAAAAAGAA 


490 






IIIIIIIIIIIIMIIIIIIIIIillllllllllllllllllllllllllllilllllll 




Db 


427 


AAAAAGGAAGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTTGGAAAAAGAA 


486 


Qy 


491 


AAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATfcCATGCCTTT 


550 






IIIIIIIMIIIIMMMMIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIM 




Db 


487 


AAACAGAGGGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATTCCATGCCTTT 


546 


Qy 


551 


GAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGTTCAAGAGTTC 


610 






IIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilll 




Db 


547 


GAGGAGATGATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGTTCAAGAGTTC 


606 


Qy 


611 


GGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCTTGTGTA 


670 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


607 


GGGAAGGTAGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCTTGTGTA 


666 


Qy 


671 


GATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAACACAGGCATG 


730 • 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


667 


GATGTGGCCCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAACACAGGCATG 


726 


Qy 


731 


AGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTTAAGCGTCATA 


790 






illilllllllllMlllllilllllllllllllllllllllMIIIIIIIIIIIIIIII 




Db 


727 


AGGCCAGCTAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTTAAGCGTCATA 


786 


Qy 


791 


GAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAATCTGTGCTCA 


850 






lllliillllllllllllllllllillllllllilllMlilllillllllllllillll 




Db 


787 


GAAAATGTTCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAATCTGTGCTCA 


846 


Qy 


851 


GAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTGTGGAGTCCTC 


910 






Mill lllllllillllllllllllll III llllllllllllllllllllllllllllll 




Db 


847 


GAATTTCTCCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTGTGGAGTCCTC 


906 


Qy 


911 


TGTGGAAAACTGATGAGAAATGAATTCACAATGACACATGTTCTCATCCCCAGACAAAAT 


970 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIillliillil 




Db 


907 


TGTGGAAAACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCCCAGACAAAAT 


966 


Qy 


971 


GGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTATGCAGGATGAC 


1030 






lllllllllllllllllllllllllillllllllllllllllllllllllllllllllll 




Db 


967 


GGTGGGCCTGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTATGCAGGATGAC 


1026 


Qy 


1031 


CTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGCCTTTCTGTCC 


1090 






Nil mill III mill II Mini Mi lllllilMIII Mill III Mill UNI 




Db 


1027 


CTTGGACTCCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGCCTTTCTGTCC 


1086 


Qy 


1091 


AGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTCCATCGCAATC 


1150 






IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1087 


AGTGTGGATCTCCACACTCACTGCTCCTACCAAATGATGTTACCAGAGTCCATCGCAATC 


1146 


Qy 


1151 


GTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTATGGTCTTCAA 


1210 






iiiiiiiiiiiiiiiiiimMiiiiiiiiiiMiiiiiiiiiMiiiiiiiiimii 




Db 


1147 


GTCTGTTCCCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTATGGTCTTCAA 


1206 


Qy 


1211 


GAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGCTGTTC 


1270 


Db 


1207 


imiiiMiiiiiimiiiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiii 

GAGATTTCAACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGCTGTTC 


1266 



Qy 1271 TGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGACCTTCGATAA 133 0 



MMIMIIIMIIIIMIIIIIIIMIIIMMIIIIIIMIMIIIIIIIIIIIIIII 

Db 1267 TGTGACTGCAGCCATGTCACTGTCAAGGACAGAATTGTGACGATCACAGACCTTC^^ 1326 

Qy 1331 ATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTCACCA 1384 

MIIIIIMIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIII 

Db 1327 ATCTCAAATCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTCACCA 1380 



RESULT 5 
BC006939 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC006939 2172 bp mRNA linear ROD 03-OCT-2003 

Mus musculus Stam binding protein, mRNA (cDNA clone MGC:6839 
IMAGE:2649718) , complete cds . 
BC006939 

BC006939. 1 GI : 13905279 
MGC, 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 2172) 

Strausberg,R.L. , Feingold, E . A. , Grouse, L.H., Derge,J.G., 
Klausner,R,D. , Collins , F. S . , Wagner, L., Shenmen , C . M . , Schuler.G. D. , 
Altschul,S.F. , Zeeberg,B., Buetow,K.H., Schaef er, C. F. , Bhat,N.K., 
Hopkins.R.F. , Jordan, H., Moore, T., Max,S.I. , Wang, J., Hsieh,F,, 
Diatchenko,L. , Marusina,K., Farmer, A. A, , Rubin, G. M. , Hong,L., 
Stapleton,M. , Scares, M.B., Bonaldo,M. F . , Casavant ,T.L. , 
Scheetz,T.E. , Brownstein, M . J . , Usdin,T.B., Toshiyuki , S . , 
Carninci,P., Prange,C,, Raha,S.S., Loquellano,N,A. , Peters, G. J., 
Abramson,R.D. , Mullahy , S . J . , Bosak,S.A., McEwan,P.J., 
McKernan,K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K,C., Hale,S,, Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon,D.K. , Mu2ny,D.M., Sodergren, E . J . , Lu,X- , Gibbs,R.A., 
Fahey,J., Helton,E., Ketteman,M., Madan,A. , Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan.A. , Young, A. C, Shevchenko , Y . , 
Bouffard,G.G. , Blakesley , R . W . , Touchman, J , W . , Green, E.D., 
Dickson,M.C. , Rodriguez , A . C . , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterf ield, Y.S. , Krzywinski , M. I . , Skalska,U., Smailus, D. E. , 
Schnerch,A., Schein,J,E., Jones, S.J. and Marra,M.A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc- Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 2172) 
Strausberg, R. 
Direct Submission 

Submitted (27-APR-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Lothar Hennighausen Ph.D., Chu-Xia Deng Ph.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



DNA Sequencing by: Baylor College of Medicine Human Genome 
Sequencing Center 
Center code: BCM-HGSC 

Web site: http://www.hgsc.bcm.tmc.edu/cdna/ 
Contact : amg@bcm . tmc . edu 

Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., 
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, 
A.N., Gibbs, R.A. 



FEATURES 

source 



gene 



CDS 



misc feature 



ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E, Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 5 Row: h Column: 19 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis. Similarity but not identity to protein. 

Location/Qualifiers 

1. .2172 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain^" 129, C57BL/6J, FVB/N" 

/db_xref ="taxon: 10090" 

/clone="MGC: 6839 IMAGE : 2649718 " 

/tissue_type= "Mammary tumor. Brcal-/fl; MMTV-Cre model. 10 

months old, gross tissue." 

/clone_l ib- "NCI_CGAP_Mam3 " 

/lab_host="DH10B" 

/not e=" Vector: pCMV-SP0RT6" 

1. .2172 

/gene="Stambp" 

/note=" synonym: Amsh" 

/ db_xr e f = " Locus I D : 7 0 5 2 7 " 

/db_xref="MGI : 1917777" 

107, .1381 

/codon_start=l 

/product =" St ambp protein" 

/protein__id="AAH06939 . 1" 

/db_xref ="GI : 13905280" 

/db_xref=" Locus ID: 70527" 

/translation="MSDHGDVSLPPQDRVRILSQLGSAVELNEDIPPRRYYRSGVEII 
RMASVYSEEGNI EHAFI LYNKYI TLF I EKLPKHRDYKSAI I PEKKDAVKKLKSVAFPK 
AEELKTELLRRYTKEYEQYKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQE 
QFHAFEEMIQRQELEKERLKIVQEFGKVDPGPCGPLLPDLEKPCVDVAPSSPFSPTQT 
PDCNTGMRPAKPPWDRSLKPGALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTA 
KGI ETCGVLCGKLMRNEFTITHVLI PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI 
HTHPTQTAFLSSVDLHTHCSYQMMLPESIAIVCSPKFQETGFFKLTDYGLQEISTCRQ 
KGFHPHGRDPPLFCDCSHVTVKDRIVTITDLR" 
860. .1189 

/note="Mov34; Region: Mov34/MPN/PAD-l family. Members of 
this family are found in proteasome regulatory subunits, 
eukaryotic initiation factor 3 (eIF3) subunits and 
regulators of transcription factors. This family is also 
known as the MPN domain and PAD- 1-1 ike domain. It has been 
shown that this domain occurs in prokaryotes" 
/db_xref = " CDD :pfam01398" 



Query Match 



97.0%; Score 1343/ DB 10; Length 2172; 



Best Local Similarity 100,0%; Pred. No. 0; 

Matches 1343; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 42 AACTTATTTGTC(:M.TGTCTGACC7^TGGGGATGTGAGCCTCCCACCCa\AGACCGGGTGA 101 

i I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I 1 1 I I I I I I I 
93 AACTTATTTGTCCAATGTCTGACCATGGGGATGTGAGCCTCCCACCCCAAGACCGGGTGA 152 

Qy 102 GGATTCTGTCCCAACTTGGGAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCT 161 

1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

153 GGATTCTGTCCCAACTTGGGAGTGCAGTTGAGTTAAATGAAGACATTCCACCCCGTCGCT 212 
Qy 162 ACTACCGCTCCGGTGTTGAGATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACA 221 

IMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIllMIIIMIIIIIIMIIIMMII 

Db 213 ACTACCGCTCCGGTGTTGAGATCATCCGCATGGCGTCCGTTTACTCGGAAGAAGGCAACA 272 

Qy 222 TTGAACATGCCTTTATCCTCTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGA 281 

IMIMIIIIIIIIIII Ml lllllllllll ill III III III Mill Mill MIMII 

Db 273 TTGAACATGCCTTTATCCTCTACAACAAGTACATCACGCTGTTTATTGAAAAACTTCCGA 332 

Qy 282 AACACCGAGACTACAAATCAGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAA 341 

I I I I M I II II M I II I II II II I II II II I II I II II I II II II I II II II II II II II 
Db 333 AACACCGAGACTACAAATCAGCTATCATTCCTGAGAAGAAAGATGCTGTCAAGAAATTAA 392 

Qy 342 AGAGCGTCGCTTTCCCTAAAGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCA 4 01 

, 1 1 M M I II II I II 1 1 1 II I II II II II I II I II II I II I II II I II 1 1 1 II II II I II I 

Db 3 93 AGAGCGTCGCTTTCCCTAAAGCGGAAGAGCTGAAGACAGAGCTCTTGAGAAGATACACCA 452 

Qy 4 02 AAGAATATGAGCAGTATAAAGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATA 461 

IIIIIIIMMIIMMMIMMMMMMIIMMMMMMMMMMMMM 

Db 453 AAGAATATGAGCAGTATAAAGAGCGAAAGAAAAAGGAAGAAGAGGAACTTGCCCGAAATA 512 

Qy 462 TCGCCATCCAGCAAGAGTTGGAAAAAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGC 521 

1 1 1 1 1 1 1 1 1 1 1 M II I II I II I II I II 1 1 II II 1 1 1 II 1 1 M II II II I II II II I II 1 1 

513 TCGCCATCCAGCAAGAGTTGGAAAAAGAAAAACAGAGGGTTGCTCAGCAGAAGCAGAAGC 572 
Qy 522 AGCTAGAGCAGGAGCAATTCCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAA 581 

„ -11 1 M I M II II I II I M I M I M I II 1 1 II II I II M I II II II II II I II I M 1 1 II I 

Db 573 AGCTAGAGCAGGAGCAATTCCATGCCTTTGAGGAGATGATCCAGAGGCAGGAGCTGGAAA 632 

Qy 582 AAGAACGGCTGAAAATTGTTCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTC 641 

IINIIIIIIIIMIIilMIMMIMMMIMMMMMMMMMIMMMM 

633 AAGAACGGCTGAAAATTGTTCAAGAGTTCGGGAAGGTAGACCCTGGCCCCTGCGGGCCTC 692 
642 TGCTCCCTGATCTGGAAAAGCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCA 701 

^ NIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIi 

693 TGCTCCCTGATCTGGAAAAGCCTTGTGTAGATGTGGCCCCCAGCTCACCGTTCTCGCCCA 752 
702 CGCAGACTCCAGACTGTAACACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGT 761 

... IIIIIIINMMMIMIMIMMIIMMIIMIMMMIIIIMMMI^ 

753 CGCAGACTCCAGACTGTAACACAGGCATGAGGCCAGCTAAGCCACCTGTGGTGGACAGGT 812 
762 CCCTGAAACCTGGAGCGTTAAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCC 821 

MIIIMMMMIIMMIIIIMMMMMIMMIMMMMMMMMMIM 

813 CCCTGAAACCTGGAGCGTTAAGCGTCATAGAAAATGTTCCCACCATTGAAGGCCTGCGCC 872 
822 ACATCGTGGTGCCCCGTAATCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCG 881 

lllilMIIMMIIIIIIMMIIMIIIIIIIIMIIMIIIIMIIIIIIMMIII 



Db 


873 


ACATCGTGGTGCCCCGTAATCTGTGCTCAGAATTTCTCCAGCTTGCCAGTGCCAATACCG 


932 


Qy 


882 


CCAAAGGCATTGAAACCTGTGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAA 


941 


Db 


933 


IIIIMIIIIIIIIIIIIMMIIIMIIIIMIIIIIIIIIIIIMIIIIIIIIIIIII 

CCTWy^GGCATTGAAACCTGTGGAGTCCTCTGTGGAAAACTGATGAGAAATGAATTCACAA 


992 


Qy 


942 


TCACACATGTTCTCATCCCCAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATG 

lllilllllMIIIIIIIIIIIIMIIIIIIIMIIIIillllllllllMIIIIIIIM 

TCACACATGTTCTCATCCCCAGACAAAATGGTGGGCCTGATTATTGCCACACGGAGAATG 


1001 


Db 


993 


1052 


Qy 


1002 


AAGAAGAAATTTTCTTTATGCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATA 

lllllllllllllllllllllllllillllllllllllllllllllllllllllllllll 

AAGAAGAAATTTTCTTTATGCAGGATGACCTTGGACTCCTCACTCTTGGCTGGATCCATA 


1061 


Db 


1053 


1112 


Qy 


1062 


CTCATCCAACCCAAACGGCCTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACC 

IIIIIMIIIIIII.IIIIIIIMIMIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIII 

CTCATCCAACCCAAACGGCCTTTCTGTCCAGTGTGGATCTCCACACTCACTGCTCCTACC 


1121 


Db 


1113 


1172 


Qy 


1122 


AAATGATGTTACCAGAGTCCATCGCAATCGTCTGTTCCCCA?^GTTCCAGGAAACTGGAT 


1181 


Db 


1173 


llllllilllllllllMIIMIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIMI 

AAATGATGTTACCAGAGTCCATCGCAATCGTCTGTTCCCCAAAGTTCCAGGAAACTGGAT 


1232 


Qy 


1182 


TCTTTAAGCTAACTGACTATGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTC 

IIIIIIIIMIIIIIIIIIIIMMIIIMIIIIIIIIIIIIIIIIIIIIIIMMIIII 

TCTTTAAGCTAACTGACTATGGTCTTCAAGAGATTTCAACCTGCCGGCAGAAAGGCTTTC 


1241 


Db 


1233 


1292 


Qy 


1242 


ACCCCCATGGCAGAGACCCACCGCTGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACA 

IIIIIIIIIIMIIIIIIIIIIMIMIIIIMIIMIIMIIIIIIIIIIIIMIIIII 

ACCCCCATGGCAGAGACCCACCGCTGTTCTGTGACTGCAGCCATGTCACTGTCAAGGACA 


1301 


Db 


1293 


1352 


wy 

Db 


1353 


UAA. i i Li i LiALUA i LALAbALL i i LCjA i AAATLTCAAATCATGAACCAGGGAGATGGATCA 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GAATTGTGACGATCACAGACCTTCGATAAATCTCAAATCATGAACCAGGGAGATGGATCA 


1361 
1412 


Qy 


1362 


CTGGGTAACAGCACTTGTCACCA 1384 

lllllllllllllllllllllll 
CTGGGTAACAGCACTTGTCACCA 1435 




Db 


1413 





RESULT 6 
AY083159 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 



AY083159 1544 bp mRNA linear ROD 26-MAR-2002 

Rattus norvegicus AMSH mRNA, complete cds . 

AY083159 

AY083159 .1 GI : 19743767 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus. 

1 (bases 1 to 1544) 
Pawlak,A, and Guellaen,G, 

Direct Submission 

Submitted (08 -MAR-2002) U99, INSERM, Hopital Henri Mondor, Creteil 
94010, France 

Location/Qualifiers 



source 1. .1544 

/organism="Rattus norvegicus" 

/mol_type=="mRNA" 

/clb_xref = "taxon: 10116" 
CDS ' 51. .1325 

/ codon_s t art = 1 

/product -"AMSH" 

/protein_id="AAL92520 . 1" 

/db_xref ="GI : 19743768" 

/ t ransla t ion= " MSDHADVSLPPQDRVRI LSQLQSAVELNEDI PPRRYFRSGVEII 
RMASI YSEEGNIEHAFILYNKYITLFIEKLPKHRDYKSAI I PEKKDAVKKLKNVAFPK 
AEELKTELLKRYTKEYEQYKERKKKEEEELARNIAIQQELEKEKQRVAQQKQKQLEQE 
QFHAFEKMIQKQELEKERLKIVQEFGKVDPGPCGPLiLPDLEKPCVDVAPSSPFSPTQT 
SDCNTTLRPAKPPWDRSLKPGALSVIENVPTIEGLRHIWPRNLCSEFLQLASANTA 
KGI ETCGVLCGKLMRNEFTITHVLI PRQNGGPDYCHTENEEEI FFMQDDLGLLTLGWI 
HTHPTQTAFLSSVDLHTHCSYQMMLPESIAIVCSPKFQETGFFKLTDYGLQEISTCRQ 
KGFHPHGRDPPLFCDCSHVTVKDRI VTI TDLR " 

ORIGIN 



Query Match 87.3%; Score 1207.6; DB 10; Length 1544; 

Best Local Similarity 93.7%; Pred. No. 0; 

Matches 1280; Conservative 0; Mismatches 84; Indels 2; Gaps 2; 



Qy 


19 


CTGACTGTCATCCTTCACGAAAGAACTTATTTGTCCAATGTCTGACCATGGGGATGTGAG 


78 


Db 


14 


1 MIIIMIIIIMM IIMIIIII II IIIIIMIIIIIIIIIM IIIIMM 

CCGACTGTCATCCTTCAGGAAAGAACTCATCGGTCCAATGTCTGACCATGCAGATGTGAG 


73 


Qy 


79 


CCTCCCACCCCAAGACCGGGTGAGGATTCTGTCCCAACTTGGGAGTnrAfJTTnAnTTAAA 

MIIMIIIIIIIIIIIIIIIIIIMIIMMI Mill II lllllllllllllllll 

CCTCCCACCCCAAGACCGGGTGAGGATTCTGTCGCAACTCGGTAGTGCAGTTGAGTTAAA 


1 '^ft 
±o o 


Db 


74 


133 


Qy 


13 9 


TGAAGACATTCCACCCCGTCGCTACTACCGCTCCGGTGTTGAGATCATCCGCATGGCGTC 

IIMIIIIIMI lllllllllllll II IIIMIIIIIIIIIIIIIIIMIIII II 

TGAAGACATTCCGCCCCGTCGCTACTTTCGTTCCGGTGTTGAGATCATCCGCATGGCATC 


198 


Db 


134 


193 


Qy 


199 


CGTTTACTCGGAAGAAGGCAACATTGAACATGCCTTTATCCTCTACAACAAGTACATCAC 

1 lllllll llllllllllllllllllllllllllllllllllllllllllllllllll 
CATTTACTCTGAAGAAGGCAACATTGAACATGCCTTTATCCTCTACA?^CAAGTACATCAC 


258 


Db 


194 


253 


Qy 


259 


GCTGTTTATTGAAAAACTTCCGAAACACCGAGACTACAAATCAGCTATCATTCCTGAGAA 

IIIIMIIIIIIIIIIIIIII IIIIIMIIMIIIIIIIII II llllllll Mill 

GCTGTTTATTGAAAAACTTCCAAAACACCGAGACTACAAATCGGCCATCATTCCCGAGAA 


318 


Db 


254 


313 


Qy 


319 


GAAAGATGCTGTCA?i.GAAATTAAAGAGCGTCGCTTTCCCTAAAGCGGAAGAGCTGAAGAC 

MMIIII IIIIIIMIIIIIIII IIIIIIIIIIIIIIIIIIIIMMIIIIIMI 

GAAAGACGCGGTCAAGAAATTAAAGAATGTCGCTTTCCCTAAAGCGGAAGAGCTGAAGAC 


378 


Db 


314 


373 


Qy 


379 


AGAGCTCTTGAGAAGATACACCAAAGAATATGAGCAGTATAAAGAGCGAAAGAAAAAGGA 


438 


Db 


374 


Ml IMIMI IIIIIIIIIIIMI MIIIMMIMII Mlllllllll Mill 

AGAACTCTTGAAGAGATACACCAAAGAGTATGAGCAGTATAAGGAGCGAAAGAAGAAGGA 


433 


Qy 


439 


AGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAGTTGGAAAAAGAAAAACAGAG 

M MIIIM 1 MM 11 IIIMII MIIIMMIMII 11 lllllllllllll Mill 

AGAAGAGGAACTTGCCCGAAATATCGCCATCCAGCAAGAACTGGAAAAAGAAAAGCAGAG 


498 


Db 


434 


493 


Qy 


499 


GGTTGCTCAGCAGAAGCAGAAGCAGCTAGAGCAGGAGCAATTCCATGCCTTTGAGGAGAT 

Mill IIIIIIIIIMIMIIMII IMIIMMM MMIMMMIIM MM 


558 



Db • 


494 


AGTTGCACAGCAGAAGCAGAAG(^GCTCGAG(>GGAGCAGTTCCATGCCTTTGAGAAGAT 


553 


Qy 


559 


GATCCAGAGGCAGGAGCTGGAAAAAGAACGGCTGAAAATTGTTCAAGAGTTCGGGAAGGT 

nil nil MM Mill M Mill Mill II M 1 1 II 1 1 II 1 M M II II 1 II II 

GATCCAGAAGCAGGAGCTAGAGAAAGAGCGGCTAAAAATTGTTCAAGAGTTCGGGAAGGT 


618 


Db 


554 


613 


Qy 


619 


AGACCCTGGCCCCTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCTTGTGTAGATGTGGC 

IMMMMMI IIIIIIIIIIIIIIIMIMIIIIMIIIIII MIMIIIMIIII 

AGACCCTGGCCCGTGCGGGCCTCTGCTCCCTGATCTGGAAAAGCCCTGTGTAGATGTGGC 


678 


Db 


614 


673 


Qy 


679 


CCCCAGCTCACCGTTCTCGCCCACGCAGACTCCAGACTGTAACACAGGCATGAGGCCAGC 

MIMI Mill IIIMIIIIIIIIIIIII llllllllllllll 1 llllllllll 

CCCCAGTTCACCTTTCTCGCCCACGCAGACTTCAGACTGTAACACAACCCTGAGGCCAGC 


738 


Db 


674 


733 


Qy 


739 


TAAGCCACCTGTGGTGGACAGGTCCCTGAAACCTGGAGCGTTAAGCGTCATAGAAAATGT 

MMIMMMMMMIMMMMI IIIIIIMIII IIIIIIIIIIMIIIIIIII 

TAAGCCACCTGTGGTGGACAGGTCCCTCAAACCTGGAGCATTAAGCGTCATAGAAAATGT 


798 


Db 


734 


793 


Qy 


799 


TCCCACCATTGAAGGCCTGCGCCACATCGTGGTGCCCCGTAATCTGTGCTCAGAATTTCT 

MIIMIIIIIIIIIIMIIIIMIII lllllllllll IIIIIIIIIIIIIIIIIMI 

TCCCACCATTGAAGGCCTGCGCCACATTGTGGTGCCCCGCAATCTGTGCTCAGAATTTCT 


858 


Db 


794 


853 


Qy 


859 


CCAGCTTGCCAGTGCCAATACCGCCAAAGGCATTGAAACCTGTGGAGTCCTCTGTGGAAA 

MM-IIIIIIM MM! II lllllllllll II Mill MIMI III MIMI III 

CCAGCTTGCCAGCGCCAACACTGCCA?iAGGCATCGAGACCTGTGGAGTCCTCTGTGGAAA 


918 


Db 


854 


913 


Qy 


919 


ACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCCCAGACAAAATGGTGGGCC 

Mllllllllllllllllllllllllllll llllllllllllll MIMI IIIIIIIMI 

ACTGATGAGAAATGAATTCACAATCACACATGTTCTCATCCCCAGACAAAATGGTGGGCC 


978 


Db 


914 


973 


Qy 


979 


TGATTATTGCCACACGGAGAATGAAGAAGAAATTTTCTTTATGCAGGATGACCTTGGACT 

MIMMIIIIIill Mill III MM II MIMI II MM II Mill II II II II II 

TGATTATTGCCACACAGAGAATGAAGAAGAAATTTTCTTTATGCAGGATGATCTTGGACT 


1038 


Db 


974 


1033 


Qy 


1039 


CCTCACTCTTGGCTGGATCCATACTCATCCAACCCAAACGGCCTTTCTGTCCAGTGTGGA 

MIMIIIIIIIIIIIIIIII II MIMMMMMMIMIMMMMMMMM 

CCTCACTCTTGGCTGGATCCACACCCATCCAACCCAAACGGCCTTTCTGTCCAGTGTGGA 


1098 


Db 


1034 


1093 


Qy 


1099 


TCTCOVCACTCACTGCTCCTACCAAATGATGTTACCAGAGTCCATCGCAATCGTCTGTTC 

MJMMII Mill MIMI II MM III III III II MM III Mill Mill II 

TCTGCACACGCACTGCTCCTACCAAATGATGTTACCAGAGTCCATAGCAATTGTCTGCTC 


1158 


Db 


1094 


1153 


Qy 


1159 


CCCAAAGTTCCAGGAAACTGGATTCTTTAAGCTAACTGACTATGGTCTTCAAGAGATTTC 

Ml IMIIIIIIII Mllllllllllll IMMMIMMI IIIMIMIIIIII 

CCCCAAGTTCCAGGAGACTGGATTCTTTAAATTAACTGACTATGGCCTTCAAGAGATTTC 


1218 


Db 


1154 


1213 


Qy 


1219 


AACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGCTGTTCTGTGACTG 

IIMMMIIMMIIMMMIIIIIIIIIIIMIIIIIIIMIIIIIIMIIIIMII 

AACCTGCCGGCAGAAAGGCTTTCACCCCCATGGCAGAGACCCACCGCTGTTCTGTGACTG 


1278 


Db 


1214 


1273 


Qy 


1279 


CAGCCATGTCACTGTCAAGGAnAnAATTnTnA CdhTrL, mrn rpT'T'pr' atq a a ^'n-r-r'A a a 

MMMIIIIIIIMIII IIMIIIIIIIMIIIII MIMI Mllllllllllll 1 

CAGCCATGTCACTGTCAAAGACAGAATTGTGACGATCACAGACCTTCGATAAATCTC-AG 


1338 


Db 


1274 


1332 


Qy 


1339 


TCATGAACCAGGGAGATGGATCACTGGGTAACAGCACTTGTCACCA 1384 

MIIMIMI MM III MMIIIIII llllll Mill 

TCATGAACCA-GGAGGTGGCCCACTGGGTAAACACACTTGCCACCA 1377 




Db 


1333 





